High-dimensional visualisation for novelty detection

نویسندگان

  • David A. Clifton
  • Peter R. Bannister
  • Lionel Tarassenko
  • Lei A. Clifton
  • Srini Sundaram
  • Steve King
چکیده

A key step in the application of novelty detection techniques to high-dimensional data is the exploration of relationships that are typically not obvious when dimensions are examined independently. Visualisation techniques allow such exploration of the structure of the data set by mapping high-dimensional data into lower dimensionality for inspection. This paper discusses the application of the NeuroScale visualisation method for construction of this mapping, which is commonly employed due to its ability to interpolate between training examples into areas of data space not previously encountered. 1) We show that there are disadvantages to using this visualisation method for extrapolation, as is commonly performed when visualising previouslyunseen test data which are “abnormal”. 2) We describe a method for ensuring consistent projection of such previously-unseen “abnormal” examples. 3) We show how the proposed technique can also be used to provide a visualisation of highdimensional decision boundaries, as are typically applied to models of normality in high-dimensional novelty detection cases. An example from a probabilistic model of normality is presented, in which a decision boundary is computed using Extreme Value Statistics and then visualised in two dimensions, showing how the method can be used to communicate the results of novelty detection and allow analysis of high-dimensional “abnormal” data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Novelty Detection in Large-Vehicle Turbocharger Operation

We develop novelty detection techniques for the analysis of data from a large-vehicle engine turbocharger in order to illustrate how abnormal events of operational significance may be identified with respect to a model of normality. Results are validated using polynomial function modelling and reduced dimensionality visualisation techniques to show that system operation can be automatically cla...

متن کامل

Specific and Generic Modelling for Jet Engine Novelty Detection

This paper describes a method for modelling the normal behaviour of a modern military gas-turbine engine, for which fault data are particularly scarce. Signatures of fundamental tracked order vibration amplitude are used to characterise the normal operation of the engine, with models formed in high-dimensional space trained on a fleet of engines of the same class. These models are shown to be a...

متن کامل

Outlier Detection and Visualisation in High Dimensional Data

The outlier detection problem has important applications in the field of fraud detection, network robustness analysis, and intrusion detection. Such applications have to deal with high dimensional data sets with hundreds of dimensions. However, in high dimensional space, the data are sparse and the notion of proximity fails to retain its meaningfulness. Many recent algorithms use heuristics suc...

متن کامل

SVMs for novel class detection in Bioinformatics

Novelty Detection techniques might be a promising way of dealing with high-dimensional classification problems in Bioinformatics. This paper presents the early results of the use of a One-Class SVM approach to detect novel classes in two Bioinformatics databases. The results are compatible with the theory and inspire further investigations.

متن کامل

Worldscientiic/ws-b8-5x6-0 Main Chapter 2 the Self-organizing Map as a Tool in Knowledge Engineering

The Self-Organizing Map (SOM) is one of the most popular neural network methods. It is a powerful tool in visualization and analysis of high-dimensional data in various engineering applications. The SOM maps the data on a two-dimensional grid which may be used as a base for various kinds of visual approaches for clustering, correlation and novelty detection. In this chapter, we present novel me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008